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ABSTRACT 

We consider the massive MIMO downlink with time-division du¬ 
plex (TDD) operation and conjugate beamforming transmission. To 
reliably decode the desired signals, the users need to know the ef¬ 
fective channel gain. In this paper, we propose a blind channel es¬ 
timation method which can be applied at the users and which does 
not require any downlink pilots. We show that our proposed scheme 
can substantially outperform the case where each user has only sta¬ 
tistical channel knowledge, and that the difference in performance 
is particularly large in certain types of channel, most notably key¬ 
hole channels. Compared to schemes that rely on downlink pilots 
(e.g., fll ). our proposed scheme yields more accurate channel esti¬ 
mates for a wide range of signal-to-noise ratios and avoid spending 
time-frequency resources on pilots. 

Index Terms — Blind channel estimation, downlink, massive 
MIMO, time-division duplex. 

1. INTRODUCTION 

Massive multiple-input multiple-output (MIMO) is one of the most 
promising technologies to meet the demands for high throughput 
and communication reliability of next generation cellular networks 
(2H3. In massive MIMO, time-division duplex (TDD) operation is 
preferable since then the pilot overhead does not depend on the num¬ 
ber of base station antennas. With TDD, the channels are estimated 
at the base station through the uplink training. For the downlink, un¬ 
der the assumption of channel reciprocity, the channels estimated at 
the base station are used to precode the data, and the precoded data 
are sent to the users. To coherently decode the transmitted signals, 
each user should have channel state information (CSI), that is, know 
its effective channel from the base station. 

In most previous works, the users are assumed to have statisti¬ 
cal knowledge of the effective downlink channels, that is, they know 
the mean of the effective channel gain and use this for the signal 
detection < 61171 . In these papers, Rayleigh fading channels were as¬ 
sumed. Under the Rayleigh fading, the effective channel gains be¬ 
come nearly deterministic (the channel “hardens”) when the number 
of base station antennas grows large, and hence, using the mean of 
the effective channel gain for signal detection works very well. How¬ 
ever, in practice, propagation scenarios may be encountered where 
the channel does not harden. In that case, using the mean effective 
channel gain may not be accurate enough, and a better estimate of 
the effective channel should be used. In m, we proposed a scheme 
where the base station (in addition to the beamformed data) also 
sent a beamformed downlink pilot sequence to the users. With this 
scheme, a performance improvement (compared to the case when 
the mean of the effective channel gain is used) was obtained. How¬ 
ever, this scheme requires time-frequency resources in order to send 

This work was supported in part by the Swedish Research Council (VR) 
and ELLIIT. 


the downlink pilots. The associated overhead is proportional to the 
number of users which can be in the order of several tens, and hence, 
in a high-mobility environment (where the channel coherence inter¬ 
val is short) the spectral efficiency is significantly reduced. 

Contribution: In this paper, we consider the massive MIMO 
downlink with conjugate beamformingQ We propose a scheme with 
which the users blindly estimate the effective channel gain from the 
received data. The scheme exploits the asymptotic properties of the 
mean of the received signal power when the number of base station 
antennas is large. The accuracy of our proposed scheme is investi¬ 
gated for two specific, very different, types of channels: (i) indepen¬ 
dent Rayleigh fading and (ii) keyhole channels. We show that when 
the number of base station antennas goes to infinity, the channel es¬ 
timate provided by our scheme becomes exact. Also, numerical re¬ 
sults quantitatively show the benefits of our proposed scheme, espe¬ 
cially in keyhole channels, compared to the case where the mean of 
the effective channel gain is used as if it were the true channel gain, 
and compared to the case where the beamforming training scheme 
of [D is used. 

Notation: We use boldface upper- and lower-case letters to de¬ 
note matrices and column vectors, respectively. The superscripts () T 
and Q H stand for the transpose and conjugate transpose, respec¬ 
tively. The Euclidean norm, the trace, and the expectation opera¬ 
tors are denoted by || • ||, Tr(-), and E {-}, respectively. The no- 
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tation —> means convergence in probability, and -A' means almost 
sure convergence. Finally, we use z ~ CJV (0, cr 1 2 ) to denote a cir¬ 
cularly symmetric complex Gaussian random variable (RV) a with 
zero mean and variance <r 2 . 

2. SYSTEM MODEU 

Consider the downlink of a massive MIMO system. An M- antenna 
base station serves K single-antenna users, where M K 
1. The base station uses conjugate beamforming to simultaneously 
transmit data to all K users in the same time-frequency resource. 
Since we focus on the downlink channel estimation here, we assume 
that the base station perfectly estimates the channels in the uplink 
training phase. (In future work, this assumption may be relaxed.) 
Denote by gk the M x 1 channel vector between the base station 
and the fcth user. The channel gk results from a combination of 
small-scale fading and large-scale fading, and is modeled as: 

g k = \ffh hfc, (1) 

where /3k represents large-scale fading which is constant over many 
coherence intervals, and hfc is an M x 1 small-scale channel vector. 
We assume that the elements of hfc are i.i.d. with zero mean and unit 
variance. 

1 We consider conjugate beamforming since it is simple and nearly opti¬ 

mal in many massive MIMO scenarios. More importantly, conjugate beam¬ 
forming can be implemented in a distributed manner. 




Let Sk, E {|Sfc| 2 } = 1, k = 1,... ,K, be the symbol intended 
for the fcth user. With conjugate beamforming, the M x 1 precoded 
signal vector is given by 

x = -y/aGs, (2) 

where s = [si, S 2 , ■ ■ ■, sk] T , G = [gi ... gx] is an Mx K channel 
matrix between the K users and the base station, and a is a normal¬ 
ization constant chosen to satisfy the average power constraint at the 
base station: 

E {II X H 2 } = P- 

Hence, 

“ = E {Tr (GG ff )} ' (3) 

The signal received at the fcth user is 

yk = gf x + n k = Vagk Gs + nk 

K 

= Va\\Sk\\ 2 s k + \fa ^2 SkSk'Sk' + n k , (4) 

k'^k 

where n k ~ CM (0,1) is the additive Gaussian noise at the fcth user. 
Then, the desired signal Sk is decoded. 

3. PROPOSED DOWNLINK BLIND CHANNEL 
ESTIMATION TECHNIQUE 

The kth user wants to detect Sk from yk in 0. For this purpose, it 
needs to know the effective channel gain ||gfc|| 2 . If the channel is 
Rayleigh fading, then by the law of large numbers, we have 

1 ii i |2 o 

M l gfc l 

as M —>■ oo. This implies that when M is large, ||gfc || 2 at Mfa (we 
say that the channel hardens). So we can use the statistical properties 
of the channel, i.e., use E {||gfc|| 2 } = M/3k as a good estimate of 
||gfc|| 2 when detecting Sk- This assumption is widely made in the 
massive MIMO literature. However, in practice, the channel is not 
always Rayleigh fading, and does not always harden when M —> 
oo. For example, consider a keyhole channel, where the small-scale 
fading component hfc is modeled as follows 181191 : 

hk=u k hk, (5) 


of m, we need to spend at least 50 symbols for sending pilots. As 
a result, less than 50 of the 100 downlink symbols are used for pay- 
load in each coherence interval, and the insertion of the downlink 
pilots reduces the overall (uplink+downlink) spectral efficiency by a 
factor of 1/4. 

In what follows, we propose a blind channel estimation method 
which does not require any downlink pilots. 

3.1. Mathematical Preliminaries 

Consider the average power of the received signal at the fcth user 
(averaged over s and n*). From 0. we have 

K 12 

E {l2/fc| 2 } = Q llgfc|| 4 + « jsfgfe'l + 1 - (6) 

k'^k 

The second term of 0 can be rewritten as 
k K 

\ H H H ~H A ~ 

^ / J |Sfc gfc'I ^ v Sfc'SfcSfc Sfc' (') 

k'^k k'^k 

where gfc = [gf ... gf_i gf + i . .. gx] T , and A is an M(I< - 
1) x M(K — 1) block-diagonal matrix whose (i,i)-block is the 
M x M matrix gfcgjf. Since A and g k are independent, as M(K — 
1) —> oo, the Trace lemma gives fTOl 

1 K i K 

M(K-l) ^ l Sfc gfc 'l “ M(K — 1) ^ ^ °- 

v ' k’^tk v ' k’^k 

( 8 ) 

Substituting 0 into 0. as M(K — 1) —> oo, we have 

E {M 2 } 1 ( „ „4 , ^ 0 „ 112 , A 

M(K — 1) ~ M(K — 1) a||gfc|1 + a l^M\Sk\\ +1 

\ ^ J 

“-*0. (9) 

The above result implies that when M and K are large, 

K 

E {|t/fc| 2 } « a||gfe|| 4 + a ^2 fa’\\gk\\ 2 + 1. (10) 

k'y£k 

Therefore, the effective channel gain ||gk|| 2 can be estimated from 
E {li/fc | 2 } by solving the quadratic equation <10t . 


where v k and the M elements of are i.i.d. CM (0,1) RVs. For 
the keyhole channel 0. by the law of large numbers, we have 

^yllgfcll 2 —/ 3 fcM 2 4 o, 

which is not deterministic, and hence the channel does not harden. 
In this case, using E {|| gfc II 2 } = Mfa as an estimate of the true 
effective channel ||gfc || 2 to detect Sk may result in poor performance. 

For the reasons explained, it is desirable that the users estimate 
their effective channels. One way to do this is to have the base sta¬ 
tion transmit beamformed downlink pilots as proposed in QJ. With 
this scheme, at least K downlink pilot symbols are required. This 
can significantly reduce the spectral efficiency. For example, sup¬ 
pose M = 300 antennas serve K = 50 terminals, in a coherence 
interval of length 200 symbols. If half of the coherence interval is 
used for the downlink, then with the downlink beamforming training 


3.2. Downlink Blind Channel Estimation Algorithm 

As discussed in Section [30 we can estimate the effective channel 
gain ||gfc || 2 by solving the quadratic equation d 1 0| . It is then required 
that the fcth user knows a, YLk'^k fa 1 , and E {|t/fe| 2 }- We assume 
that the fcth user knows a and "Ylrk'^k fa'- This assumption is rea¬ 
sonable since the terms a and fa' depend ° n the large-scale 

fading coefficients, which stay constant over many coherence inter¬ 
vals. The fcth user can estimate these terms, or the base station may 
inform the kth user about them. Regarding E {| yk | 2 }. in practice, it 
is unavailable. However, we can use the received samples during a 
whole coherence interval to form a sample estimate of E {|J/fc| 2 } as 
follows: 


E {M 2 }^^ M1)|2+M2) i! + - +Mr) ' 2 . (H) 


T 










Fig. 1. Normalized MSE versus SNR for different channel estima- Fig. 2. Normalized MSE versus SNR for different channel estima¬ 
tion schemes, for Rayleigh fading channels. tion schemes, for keyhole channels. 


where yk{n) is the nth receive sample, and T is the length (in sym¬ 
bols) of the coherence interval used for the downlink transmission. 
The algorithm for estimating ||gfc || 2 is summarized as follows: 


Algorithm 1 (Proposed blind downlink channel estimation method) 

1. Using a data block ofT samples, compute £k as CEB- 

2. The channel estimate o/||gfc|| 2 , denoted by at, is determined as 


dk - 


~ a E k' 9 tkPk' + ^a 2 (E k'^kPv) +4a(£ fc -l) 


2 a 


( 12 ) 


Note that ak in d!2b is the positive root of the quadratic equation: 
= aa k + a Ek'^k Pk' a k + 1 which comes from d 1 Ob and ( II lb . 


3.3. Asymptotic Performance Analysis 

In this section, we analyze the accuracy of our proposed scheme for 
two specific propagation environments: Rayleigh fading and keyhole 
channels. For keyhole channels, we use the model {3). We assume 
that the fcth user perfectly estimates E {| J/fc 1 2 } - This is true when 
the number of symbols of the coherence interval allocated to the 
downlink, T, is large. In the numerical results, we shall show that 
the estimate of E {| j/fc | 2 } in dill is very close to E {| z/fc | 2 } even for 
modest values of T (e.g. T ~ 100 symbols). With the assumption 
= E {|r/fc| 2 }, from (J6]l and 03. the estimate of ||gfc|| 2 can be 
written as: 


We can see from ( TT3l l that if |efc| < 


Ylk'^k @k' .11 II2 

—\ -+ fell 


then ak ~ ||gfc|| ■ In order to see under what conditions |efc| <C 
P > \ ^ 

k'+k k , '|g fc || 2 j > we consider Qk which is defined as: 


^k'^k P k' . 
2 ' 


Qk = E ■ 


tk / E \ o E + IlSfc 


k'^fik 



(15) 


Hence, 


Qk= < 


M(M+i)p£ E Pi 

k' ^k 
K 

:E 

k' = 1 

0 K 

6M(M+l)/3g E Pi, 

k 1 ^k 
K 

E 

k' = 1 


■j, for Rayleigh fading channels, 


-7, for keyhole channels, 


(16) 


where p k ^ Ek'^k Pk’- The detailed derivations of are pre- 
sented in the Appendix. We can see that Qk = 0(1/M 2 ). Thus, 

when M 1, |efc| is much smaller than 

As a result, our proposed channel estimation scheme is expected to 
work well. 



E Pk’ 

a k =---b 


\ 


Efe'^/t Py 


+ fell 


+ e k , (13) 


where 


4. NUMERICAL RESULTS 

In this section, we provide numerical results to evaluate our proposed 
channel estimation scheme for finite M. As performance metric we 
consider the normalized mean-square error (MSE) at the fcth user: 


Cfc 


A H 

/ j | Sfe § k' 

k'^k 



(14) 


MSE fc = E 


a k — ||gfc || 
E {llgfcll 2 } 



(17) 































For the simulation, we choose M = 100, K — 20, and Pk = 
l,Vfc = 1 We define SNR = p. Figures |T| and [2] show 

the normalized MSE versus SNR for Rayleigh fading and keyhole 
channels, respectively. The curves labeled “without channel esti¬ 
mation, use E {||gfc|| 2 }” represent the case when the fcth user uses 
the statistical properties of the channels, i.e., it uses E {||gfc|| 2 } as 
estimate of ||gfc|| 2 . The curves “DL pilots [I]” represent the case 
when the beamforming training scheme of (T) with MMSE channel 
estimation is applied. The curves “proposed scheme (Algorithm 1)” 
represent our proposed scheme for different T (T = oo implies that 
the fcth user perfectly knows E {\yk | 2 })• For the beamforming train¬ 
ing scheme, the duration of the downlink training is K. For our 
proposed blind channel estimation scheme, Sk, k = 1 ,,K, are 
random 4-QAM symbols. 

We can see that in Rayleigh fading channels, the MSEs of the 
three schemes are comparable. Using E {||gfc II 2 } in lieu of the true 
||gfc|| 2 for signal detection works rather well. Flow ever, in keyhole 
channels, since the channels do not harden, the MSE when using 
E {||gfcII 2 } as estimate of ||gfc|| 2 is very large. In both propagation 
environments, our proposed scheme works very well. For a wide 
range of SNRs, our scheme outperforms the beamforming training 
scheme, even for short coherence intervals (e.g., T = 100 symbols). 
Note again that, with the beamforming training scheme of m , we ad¬ 
ditionally have to spend at least K symbols on training pilots (this is 
not accounted for here, since we only evaluated MSE). By contrast, 
our proposed scheme does not requires any resources for downlink 
training. 


5. CONCLUDING REMARKS 


Massive MIMO systems may encounter propagation conditions 
when the channels do not harden. Then, to facilitate detection of the 
data in the downlink, the users need to estimate their effective chan¬ 
nel gain rather than relying on knowledge of the average effective 
channel gain. We proposed a channel estimation approach by which 
the users can blindly estimate the effective channel gain from the 
data received during a coherence interval. The approach is compu¬ 
tationally easy, it does not requires any resource for downlink pilots, 
it can be applied regardless of the type of propagation channel, and 
it performs very well. 

Future work may include studying rate expressions rather than 
channel estimation MSE, and taking into account the channel esti¬ 
mation errors in the uplink. (We hypothesize, that the latter will not 
qualitatively affect our results or conclusions.) Blind estimation of 
/3k by the users may also be addressed. 


6. APPENDIX 


Here, we provide the proof of ( 116b . From ( 1151 ). we have 


Qk = E {|efc| 2 } / E 


£ Pk' + ||gfc| 

fc'^fc 



( 18 ) 


• Rayleigh Fading Channels: 


For Rayleigh fading channels, we have 

, / \ 2v 

[ / , K \ I 


y fiw +iig*! i 




k'^k 


ik'^k 




+ I £ Pv) E {llgfcll 2 } + E {IlgA-ll 4 } 

\k'^k ) 

/ \ 2 


= 1 £ py\ +MPk £ py + plM 2 , (19) 

\js'^k J k' = 1 

where the last equality follows ED Lemma 2.9]. We next compute 
E {|efc| 2 }. From fUl . we have 


K 


E{M 2 } = E< £ gfgfc' > + £&>' E {llgfcll 4 } 


t fcVfc 


2 


- 2 £ fa’ e i £ g"g fc ' iigfc 




k'^k 


We have, 


E < I £ jgfgfc' 


= E j llgfcll 4 ( £ 

Vfc'^fc 


( 20 ) 


, ( 21 ) 


where zy — s „' . Conditioned on g^, zy is complex Gaussian 

distributed with zero mean and variance pk' which is independent of 
gfc. Thus, zy ~ CM (0, Py) and is independent of g^. This yields 

E |^£|g^sfc'| j | = e{ iig fc || 4 }e<[( y m 2 ) } 

( K K K ^ 

£tf + ££AA 

%z£k x^k j^k j 

Similarly, 


ik'^k 


( 22 ) 


E ) £ jgfgfc'j llgfcll 2 > = E {iigfc-ii 4 } E l £ i-fc'i 2 \ 
[k'^k J [k'^k J 

K 

= PkM{M + 1) £ Py- (23) 

k'^k 

Substituting d22t . J23] ). and E {||g/e|| 4 } = /3^M(M + 1) into f20l , 
we obtain 


K 

E {|e fc | 2 } =M{M + l)pl £ P 2 y. (24) 

k'^k 

Inserting GD and 03 into GHJ. we obtain GE for the Rayleigh 
fading case. 

• Keyhole Channels: 

By using the fact that 

Zy ~ Xnf = P5) 

llgfcll llgfcll 

is the product of two independent Gaussian RVs, and following a 
similar methodology used in the Rayleigh fading case, we obtain 
for keyhole channels. 
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